Qespera: an adaptive framework for prediction of queue waiting times in supercomputer systems
نویسندگان
چکیده
Production parallel systems are space-shared, and resource allocation on such systems is usually performed using a batch queue scheduler. Jobs submitted to the batch queue experience a variable delay before the requested resources are granted. Predicting this delay can assist users in planning experiment time-frames and choosing sites with less turnaround times and can also help meta-schedulers make scheduling decisions. In this paper, we present an integrated adaptive framework, Qespera, for prediction of queue waiting times on parallel systems. We propose a novel algorithm based on spatial clustering for predictions using history of job submissions and executions. The framework uses adaptive set of strategies for choosing either distributions or summary of features to represent the system state and to compare with history jobs, varying the weights associated with the features for each job prediction, and selecting a particular algorithm dynamically for performing the prediction depending on the characteristics of the target and history jobs. Our experiments with real workload traces from different production systems demonstrate up to 22% reduction in average absolute error and up to 56% reduction in percentage prediction error over existing techniques. We also report prediction errors of less than 1 h for a majority of the jobs. Copyright © 2015 John Wiley & Sons, Ltd.
منابع مشابه
Identifying Quick Starters: Towards an Integrated Framework for Efficient Predictions of Queue Waiting Times of Batch Parallel Jobs
Production parallel systems are space-shared and hence employ batch queues in which the jobs submitted to the systems are made to wait before execution. Thus, jobs submitted to parallel batch systems incur queue waiting times in addition to the execution times. Prediction of these queue waiting times is important to provide overall estimates to the users and can also help metaschedulers make sc...
متن کاملPrediction of Queue Waiting Times for Metascheduling on Parallel Batch Systems
Prediction of queue waiting times of jobs submitted to production parallel batch systems is important to provide overall estimates to users and can also help meta-schedulers make scheduling decisions. In this work, we have developed a framework for predicting ranges of queue waiting times for jobs by employing multi-class classification of similar jobs in history. Our hierarchical prediction st...
متن کاملRefueling problem of alternative fuel vehicles under intuitionistic fuzzy refueling waiting times: a fuzzy approach
Using alternative fuel vehicles is one of the ways to reduce the consumption of fossil fuels which have many negative environmental effects. An alternative fuel vehicle needs special planning for its refueling operations because of some reasons, e.g. limited number of refueling stations, uncertain refueling queue times in the stations, variable alternative fuel prices among the stations, etc. I...
متن کاملAppointment process of open-heart surgery in Shahid Rajaei Heart Hospital using a discrete event simulation approach
Background: The waiting list is a list of selected patients in the surgical queue. If demand exceeds capacity, the waiting list grows rapidly, which may lead to unacceptable waiting for patients, especially those in need of acute medical care. Patients waiting for heart surgery are placed on the waiting list for surgery, and sometimes the waiting time is longer than patients expect. Reducing th...
متن کاملMultiple vacation policy for MX/Hk/1 queue with un-reliable server
This paper studies the operating characteristics of an MX/Hk/1 queueing system under multiple vacation policy. It is assumed that the server goes for vacation as soon as the system becomes empty. When he returns from a vacation and there is one or more customers waiting in the queue, he serves these customers until the system becomes empty again, otherwise goes for another vacation. The brea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Concurrency and Computation: Practice and Experience
دوره 28 شماره
صفحات -
تاریخ انتشار 2016